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A New Regularity Lemma and 
Faster Approximation Algorithms for Low Threshold Rank Graphs 

Shayan Oveis Gharan* Luca Trevisan^ 



f^*) . Abstract 

Kolla and Tulsiani [KT07, Kolll] and Arora, Barak and Steurer [ABS10] introduced the 
technique of subspace enumeration, which gives approximation algorithms for graph problems 

Qsuch as unique games and small set expansion; the running time of such algorithms is exponential 
in the threshold-rank of the graph. 

Guruswami and Sinop [GS11, GS12], and Barak, Raghavendra, and Steurer [BRS11] devel- 
oped an alternative approach to the design of approximation algorithms for graphs of bounded 
threshold-rank, based on semidefinite programming relaxations in the Lassere hierarchy and 
on novel rounding techniques. These algorithms are faster than the ones based on subspace 
enumeration and work on a broad class of problems. 

In this paper we develop a third approach to the design of such algorithms. We show, 
constructively, that graphs of bounded threshold-rank satisfy a weak Szemeredi regularity lemma 
analogous to the one proved by Frieze and Kannan [FK99] for dense graphs. The existence of 
efficient approximation algorithms is then a consequence of the regularity lemma, as shown by 
Frieze and Kannan. 

C*") ■ Applying our method to the Max Cut problem, we devise an algorithm that is faster than 

all previous algorithms, and is easier to describe and analyze. 



1 Introduction 



Kolla and Tulsiani [KT07, Kolll] and Arora, Barak and Steurer [ABS10] proved that the Unique 
Games problem can be approximated efficiently if the adjacency matrix of a graph associated with 
^ ■ the problem has few large eigenvalues; they show that, for every optimal solution, its indicator 

vector is close to the subspace spanned by the eigenvectors of the large eigenvalues, and one can 
find a solution close to an optimal one by enumerating an e-net for such a subspace. Such subspace 
enumeration algorithm runs in time exponential in the dimension of the subspace, which is the 
number of large eigenvalues; such a parameter is called the threshold rank of the graph. Arora, 
Barak and Steurer show that the subspace enumeration algorithm can approximate other graph 
problems, in regular graphs, in time exponential in the threshold rank, including the Uniform 
Sparsest Cut problem, the Small-Set Expansion problem and the Max Cut problem 1 . 



* Department of Management Science and Engineering, Stanford University. Supported by a Stanford Graduate 
Fellowship. Email: shayanOst anf ord. edu. 

^Department of Computer Science, Stanford University. This material is based upon work supported by the 
National Science Foundation under grant No. CCF 1017403. Email:trevisan@stani ord.edu. 

1 The subspace enumeration algorithm does not improve the 0.878 approximation guarantee of Goemans, 
Williamson [GW95], but it finds a solution of approximation factor 1 — 0(e) if the optimum cuts at least 1 — e. 
fraction of edges. 
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Barak, Raghavendra and Steurer [BRSll] and Guruswami and Sinop [GSll, GS12, GS13] devel- 
oped an alternative approach to the design of approximation algorithms running in time exponential 
in the threshold rank. Their algorithms are based on solving semidefinite programming relaxations 
from the Lasserre hierarchy and then applying sophisticated rounding schemes. The advantage 
of this approach is that it is applicable to a more general class of graph problems and constraint 
satisfaction problems, that the approximation guarantee has a tighter dependency on the threshold 
used in the definition of threshold rank and that, in same cases, the algorithms have a running time 
of f(k, e) • where k is the threshold rank and 1 ± e is the approximation guarantee, instead of 
the running time of n°( k ' which follows from an application of the subspace enumeration algorithm 
for constant e. 

In this paper we introduce a third approach to designing algorithms for graphs of bounded 
threshold rank, which is based on proving a weak Szemeredi regularity lemma for such graphs. 

The regularity lemma of Szemeredi [Sze78] states that every dense graph can be well approx- 
imated by the union of a constant number of bipartite complete subgraphs; the constant, how- 
ever, has a tower-of-exponentials dependency on the quality of approximation. Frieze and Kannan 
[FK96, FK99] prove what they call a weak regularity lemma, showing that every dense graph can be 
approximated up to an error en 2 in the cut norm by a linear combination of 0(1/ e 2 ) cut matrices 
(a cut matrix is a bipartite complete subgraph) with bounded coefficients. Frieze and Kannan also 
show that such an approximation can be constructed "implicitly" in time polynomial in 1/e and 
that, for a weighted graph which is a linear combination of a cut matrices, several graph prob- 
lems can be approximated in time exp(0(<r)) + poly(ra) time. Combining the two facts one has a 
exp(poly(l/e)) + poly(n) time approximation algorithm for many graph problems on dense graphs. 

We prove that a weak regularity lemma holds for all graphs of bounded threshold rank. Our 
result is a proper generalization of the weak regularity lemma of Frieze and Kannan, because dense 
graphs are known to have bounded threshold rank 2 . For a (weighted) G = (V, E) with adjacency 
matrix A, and diagonal matrix of vertex degrees D, D^ 1 / 2 AD^ 1 / 2 is called the normalized adjacency 
matrix of G. If the square sum of the eigenvalues of the normalized adjacency matrix outside the 
range [— e/2,e/2] is equal to k (in particular, if there are at most k such eigenvalues), then we show 
that there is a linear combination of 0(k/e 2 ) cut matrices that approximate A up to 2e|-E'| in cut 
norm; furthermore, such a decomposition can be found in poly (n, k, 1/e) time. (See Theorem 2.3 
below.) Our regularity lemma, combined with an improvement of the Frieze-Kannan approximation 
algorithm for graphs that are linear combination of cut matrices, gives us algorithms of running 
time 2 0<yk J//f ) + poly(n) for several graph problems on graphs of threshold rank k, providing an 
additive approximation of 2e|22|. In problems such as Max Cut in which the optimum is f](|i?|), 
this additive approximation is equivalent to a multiplicative approximation. 



Reference 
[BRSll]* 
[GSll] 
[GS12] 


Running time 

2 0(fc/e4) . po ly( n ) 

2 fcA 3 . n O(l/e) 


Parameter k 

# of eigenvalues not in range [— c • e 2 , c • e 2 ], c > 

# of eigenvalues < — e/2 

# of eigenvalues < — e 2 /2 


this paper 


2 0(* 1 -V« 3 ) + poly(n) 


sum of squares of eigenvalues not in range [— e/8, e/8] 



Table 1: A comparison between previous algorithms applied to Max Cut and our algorithm. 
[BRSll] needs to solve r rounds of Lasserre hierarchy, for r = 0(k/e 4 ). 

2 The normalization used for dense graphs is different. 
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Table 1 gives a comparison between previous algorithms applied to Max Cut and our algorithm. 
The advantages over previous algorithms, besides the simplicity of the algorithm, is a faster running 
time and the dependency on a potentially smaller theshold-rank parameter, because the running 
time of our algorithm depends on the sum of squares of eigenvalues outside of a certain range, 
rather than the number of such eigenvalues, (recall that the eigenvalues of D~ 1 / 2 AD~ 1 / 2 are in 
the range [—1,1].) 

We now give a precise statement of our results, after introducing some notation. 



2 Statement of Results 
2.1 Notations 

Let G = (V,E) be a (weighted) undirected graph with n := |V| vertices. Let A be the adjacency 
matrix of G. For a vertex u £ V, let d(u) := Ylv~u A(u, v) be the degree of u. For a set S C V, let 
d(S) = J2veS d( v ) be the summation of vertex degrees in S, and and let m := 2\E\ = d(V). Let D 
be the diagonal matrix of degrees. For any matrix M G M VxV , we use Mp to denote the symmetric 
matrix D~ l l 2 MD~ l l 2 . We call Ax> the normalized adjacency matrix of G. It is straightforward to 
see that all eigenvalues of Ajy is contained in the interval [—1,1]. 

For two functions /, <7 G MY , let (/, g) := Ylvev f( v )9( v )- Also, let f ® g be the tensor product 
of f,g; i.e., the matrix in R^ x ^ such that (it, u) entry is f{u)g{v). For a function / G M. v , and 
S^Vlet f(S):=J2 veS f(v). 

For a set S C 1/, let I5 be the indicator function of S, and let 



d s (v) 



d{v) v£ S 

otherwise. 



For any two sets S,T CV, and a E M, we use the notation CUT(5, T, a) := acis (8) to denote the 
matrix corresponding to the cut (S, T), where (it, v) entry of the matrix is ad(u)d(v) if u G S, v € T 
and zero otherwise. We remark that CUT(S', T, a) is not necessarily a symmetric matrix. 

Definition 2.1 (Matrix Norms). For a matrix M G R VxV , and S,T QV, let 

M(S,T) := M w 

ues,ver 

The Frobenius norm and the cut norm are defined as follows: 



\\ M \\ F ■= jE m ^> 

\\M\\n ■= max \M(S,T)\ 

° S,TCV 

Definition 2.2 (Sum-Squares Threshold Rank). For any unweighted graph G, with normalized 
adjacency matrix Ax>, let Ai, . . . , A n be the eigenvalues of Ajy with the corresponding eigenf unctions 
fi,...,f n . For 5 > 0, the 5 sum-squares threshold rank of A is defined as 

t s {A v ):= X l 

i:\Xi\>5 
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Also, the 6 threshold approximation of A is defined as, 

T S (A V ):= Y, 

i:|Aj|><5 

2.2 Matrix Decomposition Theorem 

The following matrix decomposition theorem is the main technical result of this paper. 

Theorem 2.3. For any graph G, and e > 0, let k := t e /2{Aj)). There is a algorithm that writes A 
as a linear combination of cut matrices, W^, W^, . . . , , such that a < 16k / e 2 , and 



A - W {1) - ... - 



< em, 

c 



where each is a cut matrix CVT(S,T,a), for some S,T C V, such that \a\ < \fkjm. The 
running time of the algorithm is polynomial in n, k, 1/e. 

2.3 Algorithmic Applications 

Our main algorithmic application of Theorem 2.3 is the following theorem that approximates any 
cut on low threshold rank graphs with a running time 2°^ k ' e ' + poly(n). 

Theorem 2.4. Let G = (V,E), and for a given < e, let k := t e / 8 (Ax>). There is a random- 
ized algorithm such that for any maximization or minimization problem on sets of size T in time 
2°( k ' e ) + poly(n) finds a random set S such that \d(S) — T\ < em, and with constant probability 
for any S* of size d(S*) = T, 

A(S,S) > A{S*,S^)-em 
if it is a maximization problem, otherwise, 

A(S,S) < A(S*,S^) + em. 

We can use the above theorem to provide a PTAS for maximum cut, maximum bisection, and 
minimum bisection problems. 

Corollary 2.5. Let G = (V,E), and for a given e > 0, let k := t^/^Aj})- There is a randomized 
algorithm that in time 2°( fcl ' 5 / <E3 ) _|_ p ly(n) finds an em additive approximation of the maximum 
cut. 

Proof. We can simply guess the size of the optimum within an em/2 additive error and then use 
Theorem 2.4. □ 

Corollary 2.6. Let G = (V,E), and for a given e > 0, let k := t e /%{Ai>). For any of the 

maximum bisection and minimum bisection problems, there is a randomized algorithm that in time 
2°( k / e ) -\- poly(ra) finds a cut (S,S) such that \d(S) — m/2\ < em and that A(S,S) provides an 
em additive approximation of the optimum. 

Proof. For the maximum/minimum bisection the optimum must have size m/2. So we can simply 
use Theorem 2.4 with T = m/2. □ 
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3 Regularity Lemma for Low Threshold Rank Graphs 

In this section we prove Theorem 2.3. The first step is to approximate A by a low rank matrix B. 
In the next lemma we construct B such that the value of any cut in A is approximated within an 
small additive error in B. 

Lemma 3.1. Let A be the adjacency matrix of G. For < 5 < 1, let 

B ■= D 1 ^ 2 Ts(Ad)D 1 ^ 2 . 

Then, \\A - B\\ c < 5m. 

Proof. Let Ai , . . . , X n be the eigenvalues of Ax> , with the corresponding eigenfunctions f\ , . . . , f n . 
For any S, T C V, we have 

(1 S ,(A-B)1 T ) = (D^Is^A-B^D^It) 
= {^d~ s ,(Av-T s {A v ))^) 

< 5 {Vds,fi)(V^,fi) 

i:|A»|<5 



V i:\\i\<8 Vi:|A,|<5 



< 6 y ds V d-T 



2 



< S U/ dv\\ = <5m, 



where the second inequality follows by the Cauchy-Schwarz inequality. The lemma follows by noting 
the fact that ||j4 — B\\ c is the maximum of the above expression for any S,T CV. □ 

By the above lemma if we approximate B by a linear combination of cut matrices, that also is 
a good approximation of A. Moreover, since ts(Ax>) = ts(Bx>), B has a small sum-square threshold 
rank iff A has a small threshold rank. 

Lemma 3.2. For any graph G with adjacency matrix A, and 5 > ; let B := D 1 / 2 Ts(A-£>)D 1 / 2 . 
Then, 

\\Bv\\ 2 F = t s (A v ). 



Proof. The lemma follows from the fact that the frobenius norm of any matrix is equal to the 
summation of square of eigenvalues. If Ai, . . . , A n are the eigenvalues of Ax>, then 

\\B\\ 2 F = traced 2 = £ X 2 = t & (A v ) . □ 

|Ai|><5 

The next proposition is the main technical part of the proof of Theorem 2.3. We show that we 
can write any (not necessarily symmetric) matrix B as a linear combination of Odl^H^ /e 2 ) cut 
matrices such that the cut norm of B is preserved within an additive error of em. The proof builds 
on the existential theorem of Frieze and Kanan [FK99, Theorem 7]. 
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Proposition 3.3. For any matrix B £ M. VxV , k = \\Bx>\\p, and e > 0, there exist cut matrices 
W^ l \W {2 \. . such that a < 1/e 2 , and for all S,T QV, 

' B - - - ... - iyW] (S, T) < eVky/d(S)d(T), 

where each is a cut matrix CUT(5, T, a), for some S,T QV, and qGK. 

Proof. Let = B. We use the potential function h(R) := We show that while \\R\\ C > 

eVkm, we can choose cut matrices iteratively while maintaining the invariant that each time the 
value of the potential function decreases by at least e 2 h{B). Since h(R^) = h(B), after at most 
1/e 2 we obtain a good approximation of B. 

Assume that after t < 1/e 2 iterations, HM = B — W^ — ... — W®. Suppose for some S,T <^V, 

R {i \S,T) > eh(B) v / d(S)d(T). (1) 

Choose = CUT(S,T,a), for a = (S, T) / d(S)d(T), and let R^ = flW - W^ i+1 \ We 

have, 

h(R^)-h(R^)= Y, — ~ a Tu\ d d(v) 2 ~ ~ = - 2 aR il \S,T) + a 2 d(S)d(T) 

-RV(S,T) 2 ^ 



d{S)d{T) 



< -e 2 $ 2 (B), 



where the second to last equation follows from the definition of a, and the last equation follows 
from equation (1). Therefore, after at most a < 1/e 2 iterations, (1) must hold for all S, T C V. □ 

Although the previous proposition only proves the existence of a decomposition into cut matri- 
ces, we can construct such a decomposition efficiently using the following nice result of Alon and 
Naor [AN06] that gives a consant factor approximation algorithm for the cut norm of any matrix. 

Theorem 3.4 (Alon and Naor [AN06]). There is a polynomial time randomized algorithm such 
that for any given A E R 1/x ^ / , with high probability, finds sets S, T C V, such that 

\A(S,T)\ > 0.56 \\A\\ C . 

Now we are ready to prove Theorem 2.3. 

Proof of Theorem 2.3. Let 6 := e/2, and B := D 1 / 2 T S {A V )D 1 / 2 . By Lemma 3.1, we have that 
1 1 .A — B\\q < em/2. So we just need to approximate B by a set of cut matrices within an additive 
error of em/2. For a matrix R, let h(R) := ||jRd||p. By Lemma 3.2 we have h(B) = \fk. 

Let e' := e/V^k. We use the proof strategy of Proposition 3.3. Let i?W = B - W {1) - ... - W {i) . 
If U-R^ll^ > e'Vkm, then by Theorem 3.4 in polynomial time we can find S, T C V such that 

R {i) (S,T) > e'Vkm/2 > e'h{B)m/2. (2) 

Choose W {i+ V = CUT(5,r,a), for a = R ( - i \S,T)/m 2 , and let R^ = fl« - W^ i+1 l We get, 

h(R^) - h(R®) = -2aR^(S,T) + a 2 d(S)d(T) < _ rM ( S > T ) 2 < 

mr 4 
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bmce h(R^) = h(B), after a < 4/e' 2 = I6k/e 2 , we have < e'\fkm, which implies that 



A - W {1) 



c 



< \\A-B\\ C + 



B - W {1) 



c 



< em. 



This proves the correctness of the algorithm. It remains to upper bound a. For each cut matrix 
W^) = CUT(iS, T, a) constructed throughout the algorithm we have 



a 



\R®(S,T)\ 



m^ 



m- 



R (i) \fd{u)d{v) 
ues,ver U,V ^/d{u)d{v) 



< 



o(0 



m 2 \ ^— ' d(u)d(v) 
h(R®) h(B) _ Vk 



Vd(S)d(T) 



in 



m 



in 



where the first inequality follows by the Cauchy-Schwarz inequality, the second inequality uses 
d(S), d(T) < m, and the last inequality follows by the fact that the potential function is decreasing 
throughout the algorithm. This completes the proof of theorem. □ 



4 Fast Approximation Algorithm for Low Threshold Rank Graphs 

In this section we prove Theorem 2.4. First, by Theorem 2.3 in time poly(n, 1/e) we can find cut 
matrices . . . , for a = 0(k/e 2 ), such that for all 1 < i < t, = CUT (£;, T u a*), 

oti < \fk/m, and 

\\A- W\\ c < em/4, 

where W := + . . . + W( a > . It follows from the above equation that for any set S C V, 



\A(s,s)-w(s,s)\ = \A(s,s)-Y,»i d (Sns l )d(SnT l )\<^. (3) 

i=l 



Fix S* C V of size d(S*) = T (think of (S* , S*) as the optimum cut), and let s* := d(Si n S*), and 
t* := d(Ti Pi S*). Observe that by equation (3), 



A(S*,S*)-^2a lS *t* 



i=i 



< f ■ (4) 



Let a max := maxi<j< ff \ai\. Let A := Le/(48a max cr)J ; observe that A = 0(e 3 m/fc L5 ). We define an 
approximation of s*,t* by rounding them down to the nearest multiple of A, i.e., s* = A[s*/AJ, 
and t* = A|_t*/AJ. We use s*,t* to denote the vectors of the approximate values. It follows that 
we can obtain a good approximation of the size of the cut (S*, S*) just by guessing the vectors s*, 
and t*. Since \s* - s*\ < A and \t* - t*\ < A, we get, 



a 



} J \s*t*ai - s*t*ai\ < (ia max (2Am + A 2 ) < 3a max oA < em/16. (5) 



=i 



Observe that by equations (3), (4), (5), if we know the vectors s*,t*, then we can find A(S*, S*} 
within an additive error of em/2. Since s*,t* < m, there are only 0(m/A) possibilities for each s\ 
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and t*. Therefore, we afford to enumerate all possible values of them in time (m/A) 2cr , and choose 
the one that gives the largest cut. Unfortunately, for a given assignment of s*,t* the corresponding 
cut (S*,S*) may not exist. Next we give an algorithm that for a given assignment of s*,t* finds a 
cut (S, S) such that A(S, S) = J2i ^*i a i ^ em > if one exists. 

First we distinguish the large degree vertices of G and simply guess which side they are mapped 
to in the optimum cut. For the rest of the vertices we use the solution of LP(1). Let U := {v : 
d(v) > A} be the set of large degree vertices. Observe that \U\ < m/A. Let V be the coarsest 
partition of the set V \ U such that for any 1 < i < a, both Si\U and Ti\U can be written as a 
union of sets in V, and for each P e V, d(P) < A. Observe that \V\ < 2 2fJ + m/A. For a given 
assignment of s*,t*, first we guess the set of vertices in U that are contained in S*, Us* '■= S* n U, 
and U-gr '■= U \Us*- For the rest of the vertices we use the linear program LP(1) to find the 
unknown d(S* D P). 



LP(1) 

< y P 

r - em/2 < yP d ( p ) + d ( u s* 



Si 



< 1 VP G V 

< T + em/2 (6) 

< s* + A VI < i < a (7) 



PCS,; 



< ^-yp)d{P) + d{U w r\Ti) < t* + A \fl<i<a. (8) 

PCTi 

Observe that yp = ^rp) is a feasible solution to the linear program. In the next lemma which is 
the main technical part of the analysis we show how to construct a set based on a given solution 
of the LP. 

Lemma 4.1. There is a randomized algorithm such that for any S* C V, given s*,t* and Us* 
returns a random set S such that 



W{S, S) > A(S*,S*) - ^ A \d(S) - T| < em 
W(S, S) < A(S*,S*) + \d(S) - T\ < em 



e 

> — 
~ 10 

e 

> — . 
~ 10 



(9) 
(10) 



Proof. Let y be a feasible solution of LP(1). We use a simple independent rounding scheme to 
compute the random set S. We always include Us* in S. For each P G V, we include P in S, 
independently, with probability yp. We prove that S satisfies lemma's statements. First of all, by 
linearity of expectation, 

E [d(S n Si)} = d(U s *) + Y yp d ( p )> and E [d(snTi)] = d(u w ) + Y (! - 

PCSi PCTi 

In the following two claims, first we show that with high probability the expected size of d(S) 
is close to T. Then, we upper bound the expected value of W(S, S) — A(S*, S*). 

Claim 4.2. 

P[|d(5) - d(S*)\ > em] < |, 
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Proof. We use the theorem of Hoeffding to prove the claim: 

Theorem 4.3 (Hoeffding Inequality). Let X\, . . . ,X n be independent random variables such that 
for each 1 < i < n, X{ £ [0, o$]. Let X := Y17=i -^i- Then, for any e > 

P[LY-E[X]|>e]<2exp (~^— 2 

V 2-ri=l a i 

Now, by the independent rounding procedure, we obtain 



¥ [\d{S) - E [d{S)\ | > em/2] < 2 exp 



e 2 m 2 



< 2 exp 



f 2 m 2 \ f 

— ) < a«p(-i6/ e) < 3 . 



where the third inequality follows by the fact that d(P) < A and c/(P) < m. The claim follows 
from the fact that by (6), |E [d(S)} - T\< em/2. 

Claim 4.4. 



□ 



Proof. First, observe that 



E[W(S,S)] = E 



|E [W(S,S)] -A(S*,S*)\ < 



J2d(SnSi)d(SnTi)ai 



em 



4=1 



4=1 



( ]T d(P)I[PQS})( d(Q)l[QQS]) 
Pev-.PQSi Qev-.QCTi 



+ ai[d(u s * n s t )E [d(s n TO] + d(u w n P;)e [d(5 n 5<)]] 



4=1 



d(P)d(Q)M[l[PQS]l[QCS]'_ 

4=1 PCSi,QCTi 



+ J2ai{d(Us*nS i )t i + d(U-^nT i )s i } . 

4=1 

Since the event that P C 5 is independent of Q C S, iff P ^ Q we get 

E [I [P C 5] I [Q C 5]] 

Let Sj := E [d(S n «%)] and ^ := E [d(5 < n T$)l . Then, by (11) and above equation, 



(11) 



y P (l-y Q ) HP + Q 
otherwise. 



E[W(S,S)] = ^2a i s l t i -^2a i Y,yP^-yp)d(P? 

i=i PeV 



(12) 



4=1 



On the other hand, by equations (7) and (8), for all 1 < % < a, we get s* < Sj < s* + A and 
i* < < i* + A. Hence, similar to equation (5) we can show, 



em 



(13) 



4=1 
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Therefore, using equation (4) we get 



\E[W(S,S)]-A(S*,S*)\ < \E[W(S,S)]-Y,aiS*ti)\ + ™ 

i=l 

a a 

= | ^(uiSiU - ais*t*) - ^2^2 ^ypi 1 ~ yp) d ( p ) 

i=l i=i PeV 

a a 
< ^ \ a i s iU ~ a iS*i*i \ + ^2 \ a ^i^i ~ a i S i*i I + °"«max"lA + < -TT 



em 



i=l 



i=l 



em em 
— < — 
4 ~ 2 



where the equality follows by (12), the second inequality follows by the fact that d(P) < A for all 
P £ V and ^2pd(P) < m, and the last inequality follows by equations (13) and (5). This proves 
the claim. □ 

Now we are ready finish the proof of Lemma 4.1. Here, we prove (9). Equation (10) can be 
proved similarly. By Claim 4.4, 

A(S*,S*) - — <E[W{S,S)] < E [W(S, S) \ \d(S) - T\< em] + mP [\d(S) - T\> em] 

< E [W(S, S) | \d(S) - T| < em] + — . 

8 

where the second inequality holds by the fact that the size of any cut in G is at most m/2, thus 
by (3) for any S C V, W(S, S) < em/4 + m/2 < m, and the last inequality follows by Claim 4.2. 
Hence, 

E [W(S,S) | \d(S) - T| < em] > A(S*,S*) - ^ 



Since W(S, S) < m, 



W(S,S) > A(S*,S* 



3em 



\d(S) - T| < em 



> 



Therefore, (9) follows by an application of Claim 4.2. 



□ 



Our rounding algorithm is described in Algorithm 1. First, we prove the correctness, then we 
calculate the running time of the algorithm. Let S be the output set of the algorithm. First, observe 
that we always have \d(S) — T\ < em. Now let A(S*, S*) be the maximum cut among all sets of size 
r (the minimization case can be proved similarly). In the iteration that the algorithm correctly 
guesses s*,t*, Us*, there exists a feasible solution y of LP(1). by Lemma 4.1, for all 1 < i < 10/e, 



W(R y (i),R y (i))>A(S*,S*) 



3em 



A \d(Ry(i)) - T\ < em 



> 
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Since we take the best of 10/e samples, with probability 1/e the output set S satisfies W(S, S) > 
A(S*,S*) — 3em/4. Therefore, by (3), A(S,S) > A(S*,S*) — em. This proves the correctness of 
the algorithm. 

It remains to upper-bound the running time of the algorithm. First observe that if \U\ = 
0(k/e 2 ), the running time of the algorithm is dominated by the time it takes to compute a feasible 
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Algorithm 1 Approximate Maximum Cut (S, S) such that d(S) = T ± em 



for all possible values of s*,i*, and Us* Q U do 
if there is a feasible solution y of LP(1) then 
for i = 1 — > 10/e do 

For each PsP include P in R y (i), independently, with probability yp. 
end for 
end if 
end for 

return among all sets R y {i) sampled in the loop that satisfy \d{R y {i)) — T\ < em, the one that 
W(Ry(i),R y (i)) is the maximum. 



solution of LP(1). Since the size of LP is 2°^ k ^ 2 \ in this case Algorithm 1 terminates in time 
20(fc/e 2 )_ N t e that for any sample set Ry(i), both d(R y (i)) and W(R y (i), R y (i)) can be computed 
in time 2°^ k l f2 \ once we know \R y {i) n P\ for any P 6 V. 

Otherwise if \U\ 3> k/e 2 , the dependency of the running time of the algorithm to e, k is domi- 
nated by the step where we guess the subset of Us* = U(~)S* . Since a max < \fkjm and cr = 0(k/e 2 ), 
we get 

m < 12ma max (j _q( k 1 - 
A ~ e ~ V e 3 

Therefore, Algorithm 1 runs in time 2°( fcl ' J / e3 ). Since it takes poly(n, k, 1/e) to compute the de- 
composition into ^W,...,^W,the the total running time is 2°( fcl ' 5 / <E3 ) +poly(n). This completes 
the proof of Theorem 2.4. 
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